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Abstract. We present SimpleApprenant, a platform aiming to improve French 
L2 learners’ knowledge of Multi Word Expressions (MWEs). SimpleApprenant 
integrates an MWE database annotated with the Common European Framework of 
Reference for languages (CEFR) level and several Natural Language Processing 
(NLP) tools: a spelling checker, a parser, and a set of transformation rules. NLP 
tools and resources are used to build training and writing exercises to improve 
MWE knowledge and writing skills of French L2 learners. We present the user 
scenarios, the platform’s architecture, as well as the preliminary evaluation of its 
NLP tools. 
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1. Introduction 


MWE knowledge improves proficiency in writing (Paquot, 2018). Given that 
language learners have difficulties using MWEs, characterized by strong lexical 
preferences, syntactic constraints and non-compositional sense (Baldwin & 
Kim, 2010), learners should use them in the right context and should apply the 
correct morphosyntactic constraints. For instance, jeter 1’éponge ‘to abandon’, 
is the figurative sense and the determiner is singular, while jeter les éponges 
‘throw the sponges’, is used with the real sense. Collocations have strong lexical 
preferences (poser une question “put a question’, but not *demander une question 
‘ask a question’). Word-for-word MWE translation generally fails (passer l’arme 
a gauche ‘to kick the bucket’). 
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Existing online platforms for L2 learners (Language Muse, WritingMentor) 
provide few lessons dealing with English MWEs. For French, projects such as 
Base Lexicale du Francais (Verlinde, Binon, & Bertels, 2008) or DIRE Autrement 
(Hamel, 2010) represent MWEs’ morphosyntactic and semantic features or 
collocation usage (Schneider & Graén, 2018). However, these resources do not 
propose graded CEFR exercises, except for a few websites (e.g. Bonjour de France, 
Le Point du FLE). To fill this gap, SimpleApprenant proposes graded exercises, 
annotated with CEFR levels, for MWE learning. The learner’s level helps to select 
adequate content from the SimpleApprenant’s database. Moreover, the platform 
provides immediate feedback by automatic error correction. 


2. The SimpleApprenant platform 
and its scenarios 


SimpleApprenant? is an open-source Web platform, aiming to improve French 
L2 learners’ knowledge of MWEs and writing skills. For this purpose, the 
platform provides CEFR level-graded exercises for learning MWEs’ definitions 
and usage. Additionally, the platform corrects and transforms learners’ 
productions. SimpleApprenant integrates several NLP tools and resources: 
an MWE database, the spelling checker LanguageTool (Naber, 2003), and the 
parser Mind the Gap (Coavoux & Crabbé, 2017). We also developed a set of 
transformation rules, requiring parsed text as input, previously checked by 
LanguageTool. 


SimpleApprenant proposes three scenarios for learners, who freely register on 
the platform by indicating their CEFR level. In the first scenario, the learner 
should match MWEs (compatible with learners’ CEFR level) with the appropriate 
definition or gap-filling phrase. Thus, the learner learns MWEs’ definition and 
usage, by repeating these exercises and with positive and negative feedback 
(Figure | below). 


In the second scenario, the learner writes an essay, using at least one expression 
from a list of MWEs, labeled with the learner’s CEFR level. The teacher evaluates 
the essays and gives a manual feedback about the MWE usage in context. These 
exercises might be repeated by the learner, with new MWEs. 


3. https://simpleapprenant.huma-num. fr/Simplify YourFrench/accueil 


357 


Amalia Todirascu and Marion Cargill 


Figure 1. The learner matches MWEs and their definitions (first scenario). A 
green message is printed if the correct answer is selected, otherwise the 
right answer is printed in red 


Syinn) e)(-yANe) 0) aolarevalt 


Exercices 

Choisissez, parmi la liste proposée, l'expression correspondant a chaque 
Sondages définition. 
Déconnexion Si le contenu de I'exercice ne s'‘affiche pas, cela veut dire que nous ne 


pouvons pour le moment pas vous proposer cet exercice adapté a votre 
niveau. 


i 


. Avoir la liberté du mouvement des bras, des coudes. 
Réponse incorrecte! La bonne faire la claque v 
réponse est : avoir les coudées 
franches 
2. Etre payé pour applaudir en premier et entrainer les autres 
spectateurs. avoir les coudées franches =v 
Réponse incorrecte! La bonne > 
réponse est : faire la claque 


3. Avoir une bonne condition physique et mentale. 

Réponse correcte! avoir la forme v 
4. Donner suite, accepter, agréer, 

considérer avec bienveillance. faire droit v 


Réponse correcte! 
. Etre attentif (A quelque chose), remarquer (quelque chose). 
Réponse correcte! faire attention v 


uw 


The last scenario aims to improve learners’ writing skills. The learner feeds the 
texts to the platform, which are then processed by LanguageTool, the spelling 
checker integrated into SimpleApprenant. If necessary, the learner corrects the 
spelling errors identified by LanguageTool. Then, the corrected texts are parsed 
by Mind the Gap. If required, the learners apply one of the transformation rules on 
their parsed texts and receive the transformed text. This feedback should help the 
learners to avoid some grammar errors. 


SimpleApprenant is currently used by French language learners and their teachers 
from Opole University (Poland) (Al-C1 levels) and the University of Cyprus 
(A1-A2 levels). We have several CEFR levels, comparable target publics (native 
speakers of Polish or of Greek), and the possibility to follow the same students 
for several years. The teachers and students use the platform during classes as an 
additional resource, but also at home, mainly for MWE learning or for collecting 
written essays. The platform is used gradually from the first (A1-A2) to the third 
scenario (B2-C1), according to learners’ CEFR levels. 
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3: SimpleApprenant’s architecture 


For the three scenarios, SimpleApprenant uses a database of MWEs (partially 
annotated with the CEFR level), the spelling checker LanguageTool (Naber, 
2003), Mind the Gap parser (Coavoux & Crabbé, 2017), and 21 manually defined 
transformation rules, requiring parsed texts as input. 


We built the MWE database from Lexique-Grammaire (Gross, 1994) and from 
French vocabularies (Beacco, Bouquet, & Porquier, 2004). An MWE entry contains 
lemma, category (idiom, collocation), definition, gap-filling phrases (extracted 
from French Wiktionnaire), syntactic patterns, and CEFR level (Table 1). The 
CEFR level is automatically identified from a graded textbook corpus (Todirascu, 
Cargill, & Frangois, 2019) or manually assigned from reference textbooks (Beacco 
et al., 2004; Gonzalez Rey, 2007). 


Table 1. MWE examples 


MWEs Category CEFR Level 
Jeter l’éponge ‘abandon’ idiom Bl 
Faire la féte ‘celebrate’ collocation B2 
Faire attention ‘be careful’ collocation A2 


SimpleApprenant uses LanguageTool to detect spelling errors and Mind the 
Gap to create a dependency analysis of the corrected texts. The learner is asked 
to correct the spelling errors detected by LanguageTool. Then, the texts are 
parsed and the learner applies one of the transformation rules implemented in 
SimpleApprenant. Six deletion rules suppress adverbs and relative or participial 
clauses. Thirteen correction rules handle common mistakes such as verb 
agreement, determiner agreement, or negation errors. Complex transformation 
rules include passive to active voice or cleaved sentences transformed into a 
subject verb order structure. After the rule is applied, the learner consults the 
transformed text (Figure 2 below). 


The transformation of learners’ texts is a challenging task, due to erroneous 
input. Mind the Gap is a state-of-art French dependency parser: for unlabeled 
dependencies. For instance, the best Fl (harmonic mean of precision and recall 
measures) is 95.53% for reference data (Coavoux & Crabbé, 2017) but only 
83.12% for learners’ essays (obtained for 100 phrases of our corpus). We evaluated 
the transformation rules on 273 parsed sentences. Fifty-five (20.15%) were either 
not transformed or contained errors in the output. Out of those 55 sentences, 
34 sentences (61.82%) did not show any change and 21 sentences (38.18%) 
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were transformed but contained errors. Deletion (82.71%) and correction rules 
(74.72%) are more effective than transformation rules. The rules failed because 
of parsing errors (due to erroneous learners’ input) or syntactic patterns of the 
rule not matching the sentence. Even if some rules failed, agreement or negation 
errors are handled properly by the deletion and correction rules. As such, 
learners may still have feedback from these rules and see how their own text is 
transformed. 


Figure 2. The learners apply the rule adding a second negation particle pas to the 
original dependency tree for Je n’ ai oublié de le demander ‘I do not 
forget to ask it’, becoming Je n’ ai pas oublié de le mentionner dans 
mes messages 


root root 
oublié (VPP) oublié (VPP) 

suj mod aux.tps obj ponct suj mod aux.tps mod obj ponct 
Je (CLS) n’ (ADV) ai (V) de(P) . (PONCT) Je (CLS) n’ (ADV) ai(V) pas (ADV) de (P) . (PONCT) 

obj.p obj.p 

demander (VINF) demander (VINF) 
obj obj 
le (CLO) le (CLO) 


4. Conclusion and further work 


We present an online platform for French L2 acquisition, SimpleApprenant, 
including NLP tools supporting reformulation strategies. A large MWE database, 
annotated with CEFR level, is used to create exercises focusing on MWEs. The 
exercises are selected according to learners’ CEFR levels, generated with the help 
of preprocessing NLP tools: a spelling checker and a parser. The evaluation of 
transformation rules shows that some of them should be improved before being 
used by teachers and learners. We are currently revising the rules to improve 
the system’s feedback. The evaluation of the platform via Web questionnaires 
started with beginner and intermediate learners (Al-A2) and by teachers. The 
questionnaires ask the learners to classify the exercises by their difficulties and 
usefulness. The evaluation is still in progress and will be extended to higher 
learners’ levels (B2-C1). 
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